On page 1, line 8, insert : 

-This application is a continuation of Application No. 09/629,497, 
filed July 31, 2000 , (peadatrtgi ; which is a continuation of Application 
No. 09/566,551, filed May 8, 2000^(pendiRg) ; which is a continuation of 
Application No. 09/213,243, filed December 17 , 1998 (nowU.S. Patent 
6,101,152); which is a continuation of Application No. 09/196,199, 
filed on November 20, 1998 (now U.S. Patent 6,038,195), which is a 
continuation of Application No . 08/798,520, filed on February 10 , 1997 
(now U.S. Patent 5,841,580) ; which is a division of Application No. 
08/448,657, filed May 24, 1995 (now U.S. Patent 5 , 638 , 334 ); which is a 
division of Application No. 08/222,646, filed on March 31, 1994 (now 
U.S. Patent 5,513,327); which is a continuation of Application No. 
07/954,945, filed on September 30 , 1992 (now U.S. Patent 5 , 319 , 755 ) ; 
which is a continuation of Application No . 07/510,898, filed on April 
18, 1990 (now abandoned) . 



On page 3, substitute the paragraph starting on line 6 with the 
following paragraph: 

Prior art memory systems have attempted to solve the problem of 
high speed access to memory with limited success. U.S. Pat. No. 
3,821,715 (Hoff et.al.), was issued to Intel Corporation for the 
earliest 4-bit microprocessor. That patent describes a bus connecting 
a single central processing unit (CPU) with multiple RAMs and ROMs. 
That bus multiplexes addresses and data over a 4-bit wide bus and uses 
point-to-point control signals to select particular RAMs or ROMs. The 
access time is fixed and only a single processing element in permitted. 
There is no block-mode type of operation, and most important, not all 
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of the interface signals between the devices are bused (the ROM and RAM 
control lines and the RAM select lines are point-to-point) . 

On page 6, substitute the paragraph starting on line 1, with the 
following paragraph: 



In U.S. Pat. No. 4,646,270 (Voss) , a video RAM is described which 
implements a parallel-load, serial-out shift register on the output of 
a DRAM. This generally allows greatly improved bandwidth (and has been 
extended to 2 , 4 and greater width shift-out paths.) The rest of the 
interfaces to the DRAM (RAS, CAS, multiplexed address , etc.) remain the 
same as for conventional DRAMS. 



On page 10, substitute the paragraphs starting on lines 18 and 2 



(^■"with the following two paragraphs, respectively: 




FIGS. 7a and 7b show the timing whereby signals from two devices 
can overlap temporarily and drive the bus at the same time. 

FIGS. 8a and 8b show the connection and timing between bus clocks 
and devices on the bus. 




On page 11, starting at line 14, please insert the following 
paragraph : ^ 

Figure 16 is a block diagram representation of a set of internal 
registers within each device illustrated in Figure 2. 



On page 14, substitute the paragraph starting on line 3, with the 
following paragraph: 
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with reference to FIG. 16, each semiconductor device contains a 
set of internal registers 170, preferably including a device 
identification (device register 171, a device-type descriptor register 
174, control registers 175 and other registers containing other 
information relevant to that type of device. In a preferred 
implementation, semiconductor devices connected to the bus contain 
registers 172 which specify the memory addresses contained within 
that device and access-time registers 173 which store a set of one or 
more delay times at which the device can or should be available to send 
or receive data. 



On page 14, substitute the paragraph beginning on line 13 and 
ending on page 15, line 3, with the following paragraph: 

Most of these registers can be modified and preferably are set as 
part of an initialization sequence that occurs when the system is 
powered up or reset. During the initialization sequence each device on 
the bus is assigned a unique device ID number, which is stored is the 
device ID register 171. A bus master can then use these device ID 
numbers to access and set appropriate registers in other devices, 
including access-time to registers 173, control registers 175, and 
memory registers 172, to configure the system. Each slave may have one 
or several access- time registers 173 (four in a preferred embodiment) . 
In a preferred embodiment, one access- time register in each slave is 
permanently or semi -permanently programmed with a fixed value to 
facilitate certain control functions. A preferred implementation of an 
initialization sequence is described below in more detail. 
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On page 21, substitute the paragraph starting on line 8, with the 
following paragraph: 



The data block transfer occurs later at a time specified in the 
request packet control information, preferably beginning on an even 
cycle. A device begins a data block transfer almost immediately with a 
device-internal phase as the device initiates certain functions, such 
as setting up memory addressing, before the bus access phase begins. 
The time after which a data block is driven onto the bus lines is 
selected from values stored in slave access-time registers 173. The 
timing of data for reads and writes in preferably the same; the only 
difference is which device drives the bus. For reads, the slave drives 
the bus and the master latches the values from the bus. For writes the 
master drives the bus and the selected slave latches the values from 
the bus . 



On page 34, substitute the paragraph starting on line 4, with the 
following paragraph: 

Slave devices do not need to detect a collision directly, but they 
must wait to do anything irrecoverable until the last byte (byte 5) is 
read to ensure that the packet is valid. A request packet with 
Master[0:3] equal to 0 (a retry signal) is ignored and does not cause 
a collision. The subsequent bytes of such a packet are ignored. 



Please substitute the paragraph starting on page 35, line 20, and 

ending on page 36, line 12 with the following parag raph : 

In the bus-based system of this invention, a mechanism is provided 
to give each device on the bus a unique device identifier (device ID) |/! 
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after power-up or under other conditions as desired or needed by the 
system. A master can then use this device ID to access a specific 
device, particularly to set or modify registers 170 of the specified 
device, including the control and address registers. In the preferred 
embodiment, one master is assigned to carry out the entire system 
configuration process . The master provides a series of unique device ID 
numbers for each unique device connected to the bus system. In the 
preferred embodiment, each device connected to the bus contains a 
special device- type register which specifies the type of device, for 
instance CPU, 4 MBit memory, 64 MBit memory or disk controller. The 
configuration master should check each device, determine the device 
type and set appropriate control registers, including access- time 
registers 173 . The configuration master should check each memory device 
and set all appropriate memory address registers 172 . 



On page 36, substitute the paragraph starting on line 13, with the 
following paragraph: 

One means to set up unique device ID numbers is to have each 
device to select a device ID in sequence and store the value in an 
internal device ID register 171. For example, a master can pass 
sequential device ID numbers through shift registers in each of a 
series of devices, or pass a token from device to device whereby the 
device with the token reads in device ID information from another line 



Dr lines. In a preferred embodiment, device ID numbers are assigned to 
devices according to their physical relationship, for instance, their 
order along the bus . 
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Please substitute the paragraph starting on page 38 , line 25, and 
ending on page 39, line 12, with the following paragraph: 

The configuration master should choose and set an access time in 
each access-time register 173 in each slave to a period sufficiently- 
long to allow the slave to perform an actual, desired memory-access. 
For example, for a normal DRAM access, this time must be longer than 
the row address strobe (RAS) access time. If this condition is not met, 
the slave may not deliver the correct data. The value stored in a slave 
access-time register 173 is preferably one-half the number of bus 
cycles for which the slave device should wait before using the bus in 
response to a request. Thus an access time value of "1" would indicate 
that the slave should not access the bus until at least two cycles 
after the last byte of the request packet has been received. The value 
of AccessRegO is preferably fixed at 8 (cycles) to facilitate access to 
control registers. 

Please substitute the paragraph starting on page 40, line 19, and 
ending on page 41, line 16, with the following paragraph j 

In a preferred embodiment, a standard data block size can be 
selected for use with ECC, and the ECC method will determine the 
required number of bits of information in a corresponding ECC block. 
RAMs containing ECC information can be programmed to store an access 
time that in equal to: (1) the access time of the normal RAM 
(containing data) plus the time to access a standard data block (for 
corrected data) minus the time to send a request packet (6 bytes) ; or 
(2) the access time of a normal RAM minus the time to access a standard 
ECC block minus the time to send a request packet. To read a data block 
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and the corresponding ECC block, the master simply issues a request for 
the data immediately followed by a request for the ECC block. The ECC 
RAM will wait for the selected access time then drive its data onto the 
bus right after (in case (1) above) ) the data RAM has finished driving 
out the data block. Persons skilled in the art will recognize that the 
access time described in case (2) above can be used to drive ECC data 
before the data is driven onto the bus lines and will recognize that 
writing data can be done by analogy with the method described for a 
read. Persons skilled in the art will also recognize the adjustments 
that must be made in the bus-busy structure and the request packet 
arbitration methods of this invention in order to accommodate these 
paired ECC requests. 



Please substitute the paragraph starting on page 45, line 17, and 
ending on page 46, line 17, with the following: 



Referring to FIGS. 7a and 7b, although there is no stable 
condition where two devices drive the bus at the same time, conditions 
can arise because of propagation delay on the wires where one device, 
A 41, can start driving its part of the bus 44 while the bus is still 
being driven by another device, B 42 (already asserting a logical 1 on 
the bus) . In a system using current drivers, when B 42 is driving the 
bus (before time 46) , the value at points 44 and 45 is logical 1. If B 
42 switches off at time 46 just when A 41 switches on, the additional 
drive by device A 41 causes the voltage at the output 44 of A 41 to 
briefly below the normal value. The voltage returns to its normal value 
at time 47 when the effect of device B 42 turning off is felt. The 
voltage at point 45 goes to logical 0 when device B 42 turns off, then 
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drops at time 47 when the effect of device A 41 turning on is felt. 
Since the logical 1 driven by current from device A 41 is propagated 
irrespective of the previous value on the bus, the value on the bus is 
guaranteed to settle after one time of flight (tf) delay, that is, the 
time it takes a signal to propagate from one end of the bus to the 
other. If a voltage drive was used (as in ECL wired-ORing) , a logical 
1 on the bus (from device B 42 being previously driven) would prevent 
the transition put out by device A 41 being felt at the most remote 
part of the system, e.g. , device 43, until the turnoff waveform from 
device B 42 reached device A 41 plus one time of flight delay, giving 
a worst case settling time of twice the time of flight delay. 



Please substitute the paragraph starting on page 46, line 20, and 
ending on page 47, line 12, with the following paragraph : 



Clocking a high speed bus accurately without introducing error due 
to propagation delays can be implemented by having each device monitor 
two bus clock signals and then derive internally a device clock, the 
true system clock. The bus clock information can be sent on one or two 
lines to provide a mechanism for each bused device to generate an 
internal device clock with zero skew relative to all the other device 
clocks. Referring to FIG. 8a, in the preferred implementation, a bus 
clock generator 50 at one end of the bus propagates an early bus clock 
^'signal in one direction along the bus, for example on line 53 from 
right to left, to the far end of the bus. The same clock signal then is 
passed through the direct connection shown to a second line 54, and 
returns as a late bus clock signal along the bus from the far and to 
the origin, propagating from left to right. A single bus clock line can 
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be used if it is left unterminated at the far end of the bus, allowing 
the early bus clock signal to reflect back along the same line as a 
late bus clock signal . 



Please substitute the paragraph starting on page 49, line 12, and 
ending on page 50, line 3, with the following paragraph: 



Referring to FIG. 9, each primary bus unit can be mounted on a 
single circuit board 66, sometimes called a memory stick. Each 
transceiver device 19 in turn connects to a transceiver bus 65, similar 
or identical in electrical and other respects to the primary bus 18 
described at length above. In a preferred implementation, all masters 
are situated on the transceiver bus so there are no transceiver delays 
between masters and all memory devices are on primary bus units so that 
all memory accesses experience an equivalent transceiver delay, but 
persons skilled in the art will recognize how to implement systems 
which have masters on more than one bus unit and memory devices on the 
transceiver bus as well as on primary bus units. In general, each 
teaching of this invention which refers to a memory device can be 
practiced using a transceiver device and one or more memory devices on 
an attached primary bus unit. Other devices, generically referred to as 
peripheral devices, including disk controllers, video controllers or 
I/O devices can also be attached to either the transceiver bus or a 
primairy bus \init, as desired. Persons skilled in the art will recognize 
how to use a single primary bus unit or multiple primary bus units as 
needed with a transceiver bus in certain system designs. 
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Please substitute the paragraph starting on page 53, line 24, and 
ending on page 54, line 22, with the following paragraph: 




A block diagram of the preferred input/output circuit for 
address/data/control lines is shown in FIG. 10. This circuitry is 
particularly well-suited for use in DRAM devices but it can be used or 
modified by one skilled in the art for use in other devices connected 
to the bus of this invention. It consists of a set of input receivers 
71, 72 and output driver 76 connected to input /output line 69 and pad 
75 and circuitry to use the internal clock 73 and internal clock 
complement 74 to drive the input interface. The clocked input receivers 
take advantage of the synchronous nature of the bus . To further reduce 
the performance requirements for device input receivers, each device 
pin, and thus each bus line, is connected to two clocked receivers, one 
to sample the even cycle inputs, the other to sample the odd cycle 
inputs. By thus de-multiplexing the input 69 at the pin, each clocked 
amplifier is given a full 2 ns cycle to amplify the bus low-voltage- 
swing signal into a full value CMOS logic signal. Persons skilled in 
the art will recognize that additional clocked input receivers can be 
used within the teachings of this invention. For example, four input 
receivers could be connected to each device pin and clocked by a 
.f-^modif ied internal device clock to transfer sequential bits from the bus 
to internal device circuits, allowing still higher external bus speeds 
or still longer settling times to amplify the bus low-voltage- swing 
signal into a full value CMOS logic signal. 



Please substitute the paragraph starting on page 55, line 17, and 
ending on page 56, line 6, with the following paragraph: 
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The input receivers of every slave must be able to operate during 
every cycle to determine whether the signal on the bus is a valid 
request packet. This requirement leads to a number of constraints on 
the input circuitry. In addition to requiring small acquisition and 
resolution delays, the circuits must take little or no DC power, little 
AC power and inject very little current back into the input or 
reference lines. The standard clocked DRAM sense amp shown in FIG. 11 
satisfies all these requirements except the need for low input 
currents. When this sense amp goes from sense to sample, the 
capacitance of the internal nodes 83 and 84 in FIG. 11 is discharged 
through the reference line 68 and input 69, respectively. This 
particular current is small, but the sum of such currents from all the 
inputs into the reference lines summed over all devices can be 
reasonably large. 

Please substitute the paragraph starting on page 58, line 13, and 
ending on 59, line 2, with the following paragraph: 

In the preferred embodiment, two sets of these delay lines are 
used, one to generate the true value of the internal device clock 73, 
and the other to generate the complement 74 without adding any inverter 
delay. The dual circuit allows generation of truly complementary 
clocks, with extremely small skew. The complement internal device clock 
is used to clock the "event" input receivers to sample at time 127, 
while the true internal device clock is used to clock the "odd" input 
receivers to- sample at time 125. The true and complement internal 
device clocks 73 and 74 are also used to select which data is driven to 
the output drivers.' The gate delay between the internal device clock 
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and output circuits driving the bus is slightly greater than the 
corresponding delay for the input circuits, which means that the new 
data always will be driven on the bus slightly after the old data has 
been sampled. 
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Please substitute the paragraph starting on page 60, line 1, and 
ending on page 61, line 8, with the following paragraph: 

Running the internal I/O lines in the conventional way at high bus 
cycle rates is not possible. In the preferred method, several 
(preferably 4) bytes are read or written during each cycle and the 
coliimn access path is modified to ran at a lower rate (the inverse of 
the number of bytes accessed per cycle, preferably 1/4 of the bus cycle 
rate) . Three different techniques are used to provide the additional 
internal I/O lines required and to supply data to memory cells at this 
rate. First, the number of I/O bit lines in each subarray running 
through the column decoder 147 A, B is increased, for example, to 16, 
eight for each of the two columns of column sense amps and the column 
decoder selects one set of columns from the "top" half 148 of subarray 
150 and one set of columns from the "bottom" half 149 during each 
cycle, where the column decoder selects one column sense amp per I/O 
bit line. Second, each column I/O line is divided into two halves, 
carrying data independently over separate internal I/O lines from the 
left half 147A and right half 147B of each subarray (dividing each 
subarray into quadrants) and the column decoder selects sense amps from 
each right and left half of the subarray, doubling the number of bits 
available at each cycle. Thus each column decode selection turns on n 
column sense amps, where n equals four (top left and right, bottom left 
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